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ABSTRACT 

Ongoing and future imaging surveys represent significant improvements in depth, area and seeing 
compared to current data-sets. These improvements offer the opportunity to discover up to three 
orders of magnitude more galaxy-galaxy strong lenses than are currently known. In this work we 
forecast the number of lenses discoverable in forthcoming surveys and simulate their properties. We 
generate a population of statistically realistic strong lenses and simulate observations of this population 
for the Dark Energy Survey (DES), Large Synoptic Survey Telescope (LSST) and Euclid surveys. We 
verify our model against the galaxy-scale lens search of the Canada-France-Ha waii Telescope Legacy 
Survey (CFHTLS), predicting 250 discoverable lenses compared to 220 found by Gavazzi et al. (2 014|. 
The predicted Einstein radius distribution is also remarkably similar to that found by |Sonnenleld et 
al. (2013}. For future surveys we find that, assuming Poisson limited lens galaxy subtraction, searches 
in DES, LSST and Euclid datasets should discover 2400, 120000, and 170000 galaxy-galaxy strong 
lenses respectively. Finders using blue minus red (g — i) difference imaging for lens subtraction can 
discover 1300 and 62000 lenses in DES and LSST. The uncertainties on the model are dominated 
by the high redshift source population which typically gives fractional errors on the discoverable lens 
number at the tens of percent level. We find that doubling the signal-to-noise ratio required for a lens 
to be detectable, approximately halves the number of detectable lenses in each survey, indicating the 
importance of understanding the selection function and sensitivity of future lens finders in interpreting 
strong lens statistics. We make our population forecasting and simulated observation codes publicly 
available so that the selection function of strong lens finders can easily be calibrated. 

Subject headings: gravitational lensing: strong 


1. INTRODUCTION 

Strong gravitational lensing by galaxies can be used 
to probe both astrophysics and cosmology. To date sev¬ 
eral hundred galaxy-galaxy strong lenses have been dis¬ 
covered in heterogeneous searches of photometric and 
sp ectroscopic survey data (|Myers et al.|2QQ3 Gavazzi et 
aj7| 20121 IBolton et al.||2QUo| |lnada et al.||2012| |JNegrello 


et al. 2014). The sample has been used to constrain 


vestigating trends in the mass-to-light ratio of galaxies, 
the distribution of dark-matter in galaxies and the dark- 
substructure population will much more tightly constrain 
the nature of dark matter, the initial mass function and 
galaxy formation physics than is currently possible (e.g. 
Vegetti et al.||2014 iSonnenfeld et al.||2013 ). 

rveys also have the potential f 


Future surveys 
population of exotic 


the masses and density pr ofiles of galaxies (Auger et al. 
2010]; Barnab e et al.|2011 ), the dark sub-halo population 
(Vegetti et al. 201 4|" cosmological parameters d Collett fe 

Auger|2014| |Ogun e 


For example, compound lenses are 


potential to discover a 
and rare- strong lenses systems. 


powerful probes of 


|2014[ Uguri et al.|2012; Suyu et al. 2014}, the high 

ft him 


e-Nugent et al. 201jj) 


redshift luminosity function (|Darone- 

and the nature of high reds hift sources (e.g. IlNewton et 
al.|2011| |Quider et al.|2009|) . For many of these analyses 
the shortage of suitable strong lenses is 


parameters (ICollett et al. 2012} as are strongly-lensed su- 
pernovae dRefsdal 1964} and strong lensing catastrophes 

(Urban de Xivry fe Marshall 

2009|, but only a handful 

of these systems are known 

Gavazzi et al. 2008 Kelly 

et al. 2015 Quimby et al. 2C 

)14), which limits the pre- 


strong lenses is a major limiting cision o Tcosmological constrain ts achievable with strong 


shortage 
factor. 

Several ongoing, and near-future, wide and deep sky 


pared to existing data (|Diehl et al. 2014 

Miyazaki et 

al. 2006 Ivezic et al. 2008j; Laureijs et al. 

2oi ij. ttiiw 


scale lens sample by order s of magnitude (Kuhlen et al. 
2004 Marshall et al.||2005[ ). With a large increase to the 


known strong lens population, current work could be ex¬ 
tended to new regimes; lower lens masses, higher redshift 
lenses and intrinsically fainter sources. In turn this will 
allow for investigations into the luminosity and redshift 
trends of lens and source properties. For example, in- 
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lensing (Collett & Auger 2014). 

Additional to science using sub-samples of lenses, the 
size and properties of the full ensemble of strong lenses 
is dependent on both cosmological para meters and the 
source and deflector populations (e.g. Oguri et al.|2012); 
future surveys will provide a much more powerful sample 
for strong lensing statistics. 

Recent analytical work has forecast the number of 
lensed point so urces like ly to be discovered in spectro- 
scopic surveys ([S erjeant 2014) and in photometric sur¬ 
veys (|Oguri & Marshall 2010]), however making fore¬ 
casts for lenses with extended background sources re¬ 
quires one to move beyond analytic analyses due to dif¬ 
ferent ialjnagnification across the source plane. [Doblcr 
et al. (2008} simulated lensing of spectral line emission 

































































































































2 


Collett (2015) 


regions and then simulated observing such objects with 
the SDSS spectrograph to model the selection function in 
the SLACS sur vey. By changing the source population to 
galaxies filbert et al. 2009) and the transfer functio n to 
that of forthco ming surveys (e.g. Chang et al.||2015) the 
methodology of Dobler et al.| (2008|) can be extended to 
predict galaxy-galaxy strong lensing rates in forthcoming 
photometric surveys. 

The goal of this work is to answer the question “How 
many galaxy-galaxy lenses can forthcoming imaging sur¬ 
veys potentially discover?” To answer this question we 
first build a population of realistic strong lenses (Sec¬ 
tion [2| and then simulate observations of these lenses 
(Section [3]) for four recent and forthcoming photometric 
surveys; (JFHTLS, DES, LSST and Euclid. Of course 
actually discovering strong lenses in these surveys will 
requi re the development of new methods and algorithms 




. Gavaz zi et al. 2014 Brault & Gavazzi 2014 Kiing 
] 2015|), which is beyond the scope of this article, 
4ns woi 


et al 

but this work is intended to motivate such developments 
by providing both realistic expectations for lens finders 
and realistic simulations to test lens finders upon. We 
also investigate how the features of lens finders can af¬ 
fect the number of strong lenses that they can discover, 
paying attention to how they alter the effective seeing 
and signal-to-noise of the data. In Section [7] we discuss 
our results in the context of the uncertainties inherent to 
how we have constructed the lens population and how we 
have assumed future lens finders might work. We sum¬ 
marize and conclude our results in Section [8j Through¬ 
out this paper we assume a flat ACDM cosmology with 
Qm = 0.7, and h = 0.7. 


is the ratio of semi-major and semi-minor axes( i.e. the 
flattening). Einstein radii of SIEs are given by the stellar 
velocity dispersion of the galaxy, ay, and angular diam¬ 
eter distances between observer, lens and source (Dij) 
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( 2 ) 


We neglect the lensing contribution of matter along the 
line-of-sight. 

For the light profile of the deflector, we assume an ellip¬ 
tical de Vaucolours profile concentr ic with , jmd a ligned 
with, the mass (e.g. as observed by Gavazzi et al.| [2008 


Kostrzewa-Rutkowska et al.|2014). We make the approx¬ 
imation that the effective radius, R e , of the deflector is 
the same in all observed bands, but allow the absolute 
magnitude M in each band to vary. For the colors of the 
lens we assume the rest-frame SED of a passive galaxy 
whose star formation was a single burst 10 Gyrs ago. 

Given these simplifying assumptions, our model for the 
deflector is thus described by five parameters; the lens 
redshift, cry, q , R e , and M r . Rotational, and transla¬ 
tional symmetry allow us to place the deflector at the 
center of the coordinate system and align the semi-major 
and coordi nate x-axes. 

Choi, Park, fc Vogeley (2007") used Sloan Digital Sky 


Survey (SDSS; j York et al.|2000|) data to derive the veloc¬ 
ity dispersion function of elliptical galaxies in the local 


ity dispersion 
Universe, finding 
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2. CREATING A POPULATION OF REALISTIC MOCK 
GALAXY-GALAXY STRONG LENSES 

Strong gravitational lensing occurs when a massive 
foreground object is sufficiently well aligned with a back¬ 
ground source that the lens equation has multiple solu¬ 
tions and hence multiple images of the source can form. 
The population of strong lenses therefore depends on the 
population of massive objects, background sources, and 
the geometry of the Universe. Once the lens population is 
known individual strong lens systems can be drawn from 
the population and simulated to assess if strong lens¬ 
ing detectable. Simulating individual systems requires 
knowledge of the light profile of the lens and source, the 
density profile of the source and the angular diameter 
distances between observer, lens and source. 

2.1. The foreground deflector population 
Observations have shown that the mass profiles of ellip¬ 



soids (SIEs). The density profile of an SIE is given by 

^ - 2^' < 4 > 

where ay is the velocity dispersion of the lens, r is the 
elliptical distance from the center (r 2 = x 2 /q + qy 2 ), q 


-1 


where </>* = 8.0 x 10 -3 h 3 Mpc -3 , a* = 161 kms 
a = 2.32 and fl = 2.67. In our model we assume that 
neither the shape nor the normalization of this function 
vary wit h redshift; this is con sistent with the observa¬ 
tions of Bezansonet al. (2011). 

From Equation [3] we can draw a velocity dispersion for 
each deflector to be simulated and a redshift from the dif¬ 
ferential co-moving volume function dVol/dz. The other 
parameters of our lens model, q, R e and M are covariant 
with the redshift and velocity dispersion. The fundamen¬ 
tal plane of elliptical galaxies allows us to infer R e and 
M given oy; we use the functiona l form and scatter as 
derived in Hyde & Bernardi (2009). The final parameter 


is the flattening, which is known to correlate with ay 
more massive galaxies tend to be closer to spherical than 
less massive ones. This trend is clearly seen in the SDSS 
data, which we fit with a Rayleigh distribution, where 
the Rayleigh scale parameter s is linear in ay 


P (1 - q\s = (A + Bay)) = —^ exp 


2s 2 


(4) 

The fit values are A = 0.38 and B = 5.7 x 

10 -4 (kms -1 ) -1 ). To exclude highly-flattened mass pro¬ 
files, we truncate the Rayleigh distribution at q = 0.2. 


2.2. The background source population 

For the lensed background sources we need only know 
their redshifts, light profiles, colors and absolute mag¬ 
nitudes. Since typical lensed sources are at high red¬ 
shift, the light profiles and spectroscopic redshifts of 
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Fig. 1.— The properties of the ideal lens sample i.e. all galaxy-galaxy lenses in the Universe with a deflector with cry > lOOkm/s and a 
source with unlensed i < 27. The left panel shows dlogn/d#E- The middle panel shows the distribution of lens redshifts as the solid line 
and the distribution of source redshifts as the dashed line. The right panel shows the number of lenses in each ay bin of width 10 km/s 
and the dashed line shows a Gaussian fit to this distribution, with mean 210 and width 54 km/s. 


the unlensed source population is often beyond the ca¬ 
pability of current telescopes. Neglecting intrinsically 
faint, highly magnified sources could potentially cause 
us to significantly underestimate the detectable strong- 
lens population; we therefore use the s ky catalogs s im¬ 
ulated for th e LSST collabor a tion b y |Connolly et al 
(|2010j). The |Connolly et al.| (|2010|) catalog is gener¬ 
ated by ray-tr acing through the M illennium dark mat 


ter simulation (Sprin gel et al.| 2005), with galaxies a dded 
using the semi-analytic prescription of |De Lucia et ah 
(2006) adjusted to mimic the luminosity and color dis¬ 
tributions of low redshift galaxies and the redshift and 
number-magnitude distributions derived from deep imag¬ 
ing and spectroscopic surveys. The final catalog com¬ 
prises a 4.5x4.5 square degree footprint on the sky and 
samples halo masses down to 2.5 x 10 9 M o . Due to the 
resolution limit of the simulations, this catalog is only 
complete down to i ~ 27.5, which may be insufficiently 
deep to perfectly reconstruct the faintest lens popula¬ 
tion of LSST but sufficient for the other surveys that we 
investigate here. 

For all sources, we assume that the light profile is given 
by an elliptical exponential profile with ellipticity drawn 
from a Rayleigh distribution with scale parameter s = 
0.3, truncated at q = 0.2. For the effective radius of the 
profile we assume 


io6i “ (iU 


My 

-19.5 


- 0.22 


l.+Z 

5. 


- 1.2 


+ N(0.3) 

(5) 

where N( 0.3) is a Gaussianly distributed random vari¬ 
able of scatter 0.3. The relation and scatter are in line 


with t he results of Mosleh et al. (2012) and Huang et al. 
(2013) which both show trends towards more compact 


sources at higher redshifts. 

2.3. The properties of the realistic lens systems, before 
observing with a telescope 

Gi ven t he l ens and source populations generated in sec¬ 
tions 2.1 and 2.2, we can project the lensing cross section 
back onto the source population, giving the expected dis¬ 
tribution of lens and source properties for an idealized 
survey. Our ideal model predicts 11 million lenses on the 
whole sky, with a lens velocity dispersion ay > lOOkm/s 
and a background source with i < 27. The Einstein 
radius distribution, deflector velocity dispersion distri¬ 
bution and the source and lens redshift distributions are 
shown in Figure []] The model predicts that the pop¬ 
ulation of strong lenses is dominated by galaxies with 


ay « 200 km/s and the distribution is well approximated 
by a Gaussian with mean 210 km/s and one sigma width 
of 54 km/s. The majority of lenses have Einstein radii 
around 0.5 arcseconds and sources close to the complete¬ 
ness limit of the source catalog. Even after strong lensing 
magnification most sources are too faint to be detectable 
by Euclid, and the Einstein radii are too small to be 
resolvable with LSST. 

3. SIMULATED OBSERVATIONS OF STRONG LENSES 
WITH SURVEY TELESCOPES 

In practice it is impossible to observe the idealized 
noiseless, unconvolved lenses produced by our model; 
real observations of strong lenses will involve the convo¬ 
lution with a point spread function, and various sources 
of noise. To investigate the discoverable lens population 
in future surveys, we must therefore include the transfer 
function of each survey telescope. We simulate observa¬ 
tions by CFHTLS, DES, LSST and Euclid. 

Our prescription for the transfer function is a simple 
one, but it encapsulates all the significant features rel¬ 
evant to lens finding. For each survey, the ideal lenses 
are evaluated on a pixellated grid with pixel-scale equal 
to the detector pixel-seal^] then convolved with the cir¬ 
cularly symmetric PSF discretized to the sample pixel- 
scale. We then simulate the noise assuming Poisson noise 
from the lens, source and a uniform sky background 
and a constant read-noise for each exposure. We ne¬ 
glect cosmic-rays, artifacts and ghosts. For complete and 
on-going surveys, the seeing, sky-brightness, zero-points, 
exposure-times, number of exposures, pixel-scale, read- 
noise, filter set and survey area are taken from the ex¬ 
isting observations. For future survey s we u se the LSST 
observation simulator (| Connolly et al 2010), and the Eu¬ 
clid design parameters. The assumea survey parameters 
are summarized in Table [lj 

The seeing and sky-brightness are stochastic parame¬ 
ters that vary significantly over the course of a survey, 
hence for each simulated exposure we draw from the real 
(or expected) distribution of seeing and sky-brightness 
observed in each filter set. This point is significant to 
the expected number of discoverable strong lenses for 
two reasons. Firstly, survey strategies are optimized to 
take bluer-band data in moonless skies and redder-band 
data when the moon is up. Secondly, survey strategies 
are often optimized for specific science goals, for example 
DES preferentially takes r- and i-band data in good see- 

1 We subsample the pixel grid by a factor of 25 in regions where 
the source profile varies quickly 
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Fig. 2.— A simulated lens as would be observed by forthcoming telescopes. From left to right, the top row shows a gri color composite 
image for the full-depth co-add of CFHT, DES and LSST. The top right shows the Euclid image. From left to right the second row shows 
the g — od difference image corresponding to the color composite above. Since Euclid has only one optical band, the second row far right 
shows the VIS residual assuming the galaxy can be subtracted perfectly (although the noise on the galaxy light remains). From Left to 
right, the third row shows the gri composites for the best single epoch DES imaging, the optimally stacked DES imaging, the best single 
epoch LSST imaging, and the optimally stacked LSST imaging. The fourth row shows the g — ai difference image corresponding to the 
color composite above. Our signal-to-noise detectability criteria are satisfied in all images, but we do not consider the lens to be detectable 
in the CFHT, DES and LSST full co-adds as the arcs are not resolved. The CFHT full-stack only marginally fails the resolution criteria, 
whilst the optimally stacked LSST image only marginally satisfies it 
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ing^to improve the weak lensing analysis (Diehl et al. 
2014). Survey strategies therefore lead to seeing and 
sky-brightness distributions that are often strongly cor¬ 
related, and not random draws from the seeing and sky 
brightness distributions of the telescope site. The most 
significant feature for lens finding is the strategy used for 
allocating good and bad seeing time^ Si nce t he major¬ 
ity of background sources are blue ([Gavazzi et al.||2014) 
and the Einstein radius distribution oFoiir lenses peaks 
at ^0.5 arcseconds, strategies that penalize g-band see¬ 
ing are likely to limit the potential for discovering strong 
lenses. 

In Figure [2] we show an example of a simulated lens as 
would be observed with each of CFHT, DES, LSST and 
Euclid. 

4. DEFINING A DETECTABLE LENS 

Having generated a population of strong lenses and 
simulated observations of them, we have ^10 9 expo¬ 
sures. Whilst each exposure contains a simulated strong 
lens, not all of them can plausibly be detectable as a 
strong lenses. Systems with very low signal-to-noise or 
where the Einstein radius is much less than the see¬ 
ing are unlikely to be distinguishable as strong lenses. 
Robotic st rong lens finders already exist (e.g. Marshall 
et al. 2005), but require a large amount of survey specific 
tuning and a realistic training set. Most imaging-based 
strong lens searches require at least some level of human 
input. Human classifications introduce a stochastic ele- 


not insist that the source and lens be significantly differ¬ 
ent colors as color infor mation may not be required for 
future lens finders (e.g. Joseph "et al.| |2014). 

Our criteria for a detectable strong lens are that the 
center of the source is multiply imaged 

0 % >x 2 s + yi ( 6 ) 

where 6e is the Einstein radius and (x s ,y s ) are the un- 
lensed source position. The image and counter-image 
must be resolved 


0% > r 2 + s 2 / 2, 


(7) 


ment (e.g. Gavazzi et al. 2014), but a human expert tends 
to look for defining features when assessing the probabil¬ 
ity of strong lensing. The patterns a human lens finder 
looks for are typically arc-like features, the presence of 
a similar-color counter image on the opposite side of the 
lens, features that are a different color (typically bluer) to 
the putative lens galaxy, and a morphology that is plau¬ 
sible fit by a strong lens model. Systems that meet all 
of these criteria are typically classified as A-grade strong 
lens candidates; B-grade candidates often show arc-like 
features without an obvious counter image, or multiple 
same-color sources too small to show tangential shearing. 
The final confirmation of strong lensing typically requires 


where s is the seeing and r s ,(x s ,y s ) is the unlensed 
source size. Tangential shearing of the arcs must be de¬ 
tectable; 

Mtot^s > s, p-tot > 3. (8) 

where //tot is the total magnification of the source. We 
also require that the source be detected with sufficient 
signal to noise for a human to recognize that Equations 
[6j [7] and [5] are satisfied. 

SN T ot ^ 20. (9) 

where SNtot is the total signal to noise of the lensed 
residual. 

We insist that all detectable lenses obey Equation [6] 
and that Equations [7] through [9] be satisfied in at least 
one imaging band. This is more constrictive than the 
simplest definition of a strong lens (multiple imaging of 
some part of the source), however we would be surprised 
if future lens finders can discover a significant popula¬ 
tion of galaxy-galaxy lenses that do not pass our selec¬ 
tion cuts. The one class of galaxy-galaxy lenses that will 
be discoverable but does not pass our criteria are sys¬ 
tems with an unresolved source whose image morphology 
is still sufficient to detect lensing i.e. quadruple image 
systems with sufficiently large Einstein radii and a suf¬ 
ficiently blue source to resolve the images from the lens 
galaxy; this population is likely to be small. 

5. CFHTLS: TESTING THE MODEL AGAINST A WELL 
STUDIED LENS SAMPLE 


either spectroscopic confirmation of the source redshift, 

The CFHTLS has been extensively 

searched for strong 

or a compelling lens model. Whilst spectroscopy is ex¬ 

lenses, both visually (More et al. 

2015|) and 

semi- 

pensive, we optimistically assume that any system obvi- 

automatically (Gavazzi et al. 2014| 

More et al. 

2012) 


lens, given that all of our systems can be fitted with a 
strong lens model. 

For an SIS strong lens, the center of the counter images 
are separated by twice the Einstein radius th e magn ifi¬ 


cati on is purely tangential (Schneider, Ehlers, & Falco 


1992). Thus the quadrature sum of the seeing and twice 


the source size must be less than twice the Einstein ra¬ 
dius, or counter images will be unresolved. For tangential 
shearing to be observable the source must be resolved in 
the tangential direction, i.e. the product of the source 
size and the magnification must be greater than the see¬ 
ing, additionally the magnification must be sufficiently 
large that the source is noticeably sheared. Based on vi¬ 
sual inspection of lensed galaxy subtracted residuals we 
assume that a magnification of at least three and a sig¬ 
nal to noise of at least 20 is required to unambiguously 
recognize arcs. By definition our strong lenses show a 
morphology consistent with strong lensing, but we do 


ing from galaxy to group scales. Our simulations gener¬ 
ate the t ypes of lenses that form the SL2S gal axy-scale 
sample of|So nnenfeld et al. (2013), discovered by Gavazzi 
et al. (2014|) using the RINGFINDER algorithm. Apply¬ 
ing Ringfinder to the real data yielded 401 candidates 
of which 49 are spec troscopically co nfirmed. Based on 
incomplete follow-up Gavazzi et al. (2014) predicts 220 
discoverable lenses in CEHT. Applying our detectability 
criteria to our simulated CFHTLS co-adds, we forecast 
800 discoverable strong lenses. However, after applying 
the PSF cro ss-convolution and q — ai difference imaging 
approach of Gavazzi et al. (2014) we would only classify 
290 of our candidates as still discoverable. Roughly half 
of lost systems are due to the poorer seeing in the g — ai 
difference image and the other half are lost due to the 
degrad ed signal-to -noise. After applying the lens galaxy 
cuts of |Gavazzi et al.| ([2014]), the number of detectable 
lenses decreases to 230 systems, in good agreement with 


















































6 


Collett (2015) 


TABLE 1 

Properties of large area imaging surveys considered in this work. These are fiducial numbers based on assumptions we 

DESCRIBE IN THE TEXT; GIVEN ARE SURVEY AREA Q, EFFECTIVE COLLECTING AREA, FILTER-SET, MEDIAN SEEING IN EACH FILTER (FWHM), 
MEDIAN SKY BRIGHTNESS IN EACH FILTER, TOTAL EXPOSURE TIME PER FILTER, AND PIXEL-SCALE. 


Survey 

Q 

[deg 2 ] 

Collecting area® 
[m 2 ] 

Filters 

Seeing 6 

[arcsec] 

Sky 6 

Exposure time 

[s] 

Pixel-scale 

[arcsec] 

CFHTLS 

150 

8.8 

{g,r,i c } 

{0.83, 0.79, 0.69} 

{21.7, 20.8, 19.7} 

{3500, 5500, 5500} 

0.187 

DES 

5000 

9.6 

{g,r,i c } 

{1.24, 1.05, 0.96} 

{21.7, 20.6, 19.4} 

{900, 900, 900} 

0.263 

LSST 

20000 

35 

{g,r, i c ) 

{0.81, 0.77, 0.75} 

{21.7, 21.1, 20.0} 

{3000, 6000, 6000} 

0.18 

Euclid 

15000 

1.0 

{VIS d } 

{0.18} 

{22.2 e } 

{1610 e } 

0.1 


a We do not directly use the co llecting area in our analysis, we instead use the instrumental zero-points for each survey for CFHT 

(Boulade et al. 2003| and DES (Diehl et al. 2008]), for LSST we rescale the DES zero-points by the ratio of collecting areas. For Euclid we 

assume a zero-point of 25.5 AB (S. JNiemi, Private communication) 

6 Median quantities shown 

c We only include these filters in our strong lensing model. We approximate g , r and i filters with their SDSS counterparts 

d We approximate VIS magnitudes as VIS = (r + i + z)/3 (S. Niemi, Private communication) 


e There is some variation in sky-brightness due to zodiacal and scattered light, but we adopt 22.2 as the VIS sky brightness for all Euclid 
pointings. We also neglect the fact that half of the Euclid area will have 2150s total exposure time due to the dithering pattern. (S. 
Niemi, Private communication) 



Fig. 3.— The Ein stein radiu s of t he SL2S galaxy-scale lens sam¬ 
ple, as fit by|Sonnenfeld et al.|([2013 ) shown in the gray bars and the 
Einstein radii distribution forecast by our model shown by the black 
dashed line, assuming a Ringfinder -like search of our simulated 
SL2S survey. The area under the black dashed line corresponds to 
230 lenses. The gray-bars have been scaled to have a total weight 
of 220 systems, equally up- weighting each of the 52 lens es modeled 
by|Somienfeld et al. | ( |2013| ) to match the predictions of |Gavazzi et| 


the results of Gavazzi et al. 
pect our forecast and the CF 


(2014). We should not ex- 
TT results to agree this well; 


Ringfinder can find lenses with /a < 3 if the source is 
intrinsically tangential to the lens, and does not have 
strict criteria on total signal-to-noise or on the seeing, 
both potentially increasing the discoverable lens popu¬ 
lation. Conversely, some of our simulated lenses that a 
human would consider obvious candidates are missed by 
the automate d part of Ringfinder . This was found by 


Gavazzi et al. (2014) where 77 serendipitously discovered 
probable lenses during the development of Ringfinder 
that were not part of the final statistical sample of 330 
candidates. 

In Figure [3] we compare the Einstein radii of the fore¬ 
cast 2 00 lenses with the values inferred by Sonnenfeld 
from modelling the SL2S sample and show 
HIstribution of Einstein radii are similar, thus 


. (2013} 
the dis 


et al 
that 

our seeing criteria (Equations [7] and |8| is reasonably rep¬ 
resentative of what is achievable witn an automatic ring¬ 


finder. 

Given the similarity of both the number of discov¬ 
erable lenses and the Einstein radius distribution with 


that foun d in Sonnenfeld et al. (2013) and Gavazzi et 
al. (2014) we are confident that our lens population and 
detectability criteria are good representations of what 
is discoverable in the real Universe. In the rest of this 
work we will report the number of lenses detectable in 
one band assuming Poisson limited galaxy subtraction 
(800 in the CFHTLS co-adds) as a target for future lens 
finders. We will also report the number of lenses discov¬ 
erable in the g — ai difference image, although we will not 
apply the Ringfinder cuts on lens redshift and Fband 
magnitude and assume the PSF of the difference image 
can be set to the worst of the g- or i-band PSFs rather 
than cross-convolving the PSFs. These seem plausible 
upgrades to Ringfinder and gives a forecasts of 480 
lenses discoverable in the CFHT co-adds. This forecast 
represents a more pessimistic (but more achievable) view 
of where lens finding is likely to be in the next decade. 

6. DES, LSST AND EUCLID: ON-GOING AND FUTURE 
LENS SURVEYS 

We now consider the number of detectable lenses in 
the DES, LSST and Euclid wide surveys. In the final 
co-added images of each survey we forecast that DES 
can discover 1400 lenses, LSST can discover 39000 lenses 
and Euclid can discover 170000 lenses. Requiring that 
the lenses be discoverable in the g — ai image decreases 
these numbers to 800 and 16000 for DES and LSST re¬ 
spectively. Since Euclid has only one visible filter it is 
not possible to construct an optical blue minus red image 
so no lenses can be found this way; it may be possible to 
use the near infra-red channels although they have de¬ 
creased depth and resolution, or to use lower resolution 
multi-band optical data to add color information to the 
VIS imaging (e.g. Loncan et al. 2015), but we have not 
investigated these possibilities. 


6.1. Alternative strategies for combining multiple 
exposures 

Each object in these surveys will be imaged many 
times; individual exposures can be combined to make 
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TABLE 2 

The predicted number of discoverable lenses using the model described in this work applied to current and future large 

AREA SURVEYS. 


Survey 

Criteria 0, 

Full Stack 
g,r or i g — ai 

Best Epoch 
g,r or i g — ai 

Optimal Stack 
g,r or i g — ai 

DES 

LSST** 

Euclid 0 

1400 

39000 

170000 

800 

16000 

0 d 

800 

17000 

340 

3000 

2400 

120000 

1300 

62000 


a The detectability criteria applied; for the first column is that discoverable systems must satisfy Equations [6] through [9] in at least one 
band assuming Poisson noise-limited galaxy subtraction, whilst for the second column the criteria must be satisfied in the g — ai 
difference image with the PSF of the g and i images matched to have the same seeing. The true number of lenses that forthcoming 
surveys will discover is likely to be between these two values. 

b Our source population is insufficiently deep to include high magnification lenses with intrins icall y ultra-faint (i > 27) sources. The 
numbers in the full and optimal stack columns may therefore be underestimates - see Section |7.3| 

° Since we assume all Euclid exposures have the same PSF and background, stacking all the exposures gives maximal signal to noise. 
d Since Euclid has only one visible-light filter, constructing an optical blue minus red image will not be possible. 



25000 



0 E (arcsec) 


Fig. 4.— The Einstein Radius distribution for lenses discoverable in DES (left panel) and LSST (right panel). The red distributions 
show the Einstein radius distribution of the lenses discoverable in the full stack of the final survey, whilst the blue distributions show the 
population discoverable in the best single-epoch imaging. The gray distribution shows the Einstein radius distribution of the lenses that 
can be discovered if one searches for lenses in all possible (seeing-ordered) stacks for each survey. The dashed black line in the right panel 
shows the Einstein radius distribution for galaxy-galaxy lenses discoverable in Euclid. 


a deeper, stacked image, however for ground based im¬ 
ages the exposures will not have comparable seeing; with 
some exposures having much higher resolution than oth¬ 
ers. Since our detection criterion requires the seeing to be 
less than the Einstein radius, adding poor seeing images 
to the stack can decrease the discoverable lens popula¬ 
tion. Therefore for DES and LSST we investigate alter¬ 
native stacking strategies. A trivial alternative strategy 
to co-addition is to discard all exposures except the image 
with the best seeing in each filter, in effect trading signal- 
to-noise for resolution. This strategy yields 800 (17000) 
detectable lenses for DES (LSST) of which 340 (3000) are 
still discoverable in the difference image. Fewer lenses are 
discoverable in the best seeing images than the co-adds, 
but they are not the same lenses, as Figure [4] shows; the 
co-add imaging finds the higher mass lenses whilst the 
best single epoch imaging discovers the systems with the 
brightest lensed sources. 

Given our detection criteria, an optimal stacking ex¬ 
ists which only includes exposures where the seeing is 
sufficiently good for 0% > r 2 s + (s/2) 2 and /ir s > s to be 
satisfied. Since the source size and Einstein radius is not 
known a-priori performing this optimal stacking on a real 
data set is impossible, but is still physically meaningful: 
applying a lens finder to all possible stacks (ordered by 


seeing), only increases the search effort by a factor of the 
number of exposures per filter and naturally includes the 
optimal stack. Alternatively a lens finder that can simul¬ 
taneously model all the individual exposures without any 
stacking, could potentially outperform even this stacking 
strategy. With optimal stacking, and perfect galaxy sub¬ 
traction DES (LSST) can discover 2400 (120000) lenses; 
1300 (60000) are still detectable in the g — ai difference 
imaging. For LSST, 70000 - more than half - of the 
120000 lenses that are discoverable in the filtered stack 
will not be discoverable in either the best single epoch 
imaging or the final co-add. 

6.2. The properties of the lens populations 

Our model allows us to forecast more than the total 
number of discoverable strong lenses, it also allows us 
to forecast the properties of the population. This allows 
us to predict if forthcoming surveys will open up new 
regimes of strong-lens science. A-priori we expect that 
the high resolution imaging of Euclid opens up the pos¬ 
sibility of discovering a large number of low-mass strong 
lenses and the large area and depth, but comparatively 
poorer seeing, of DES and LSST will make them efficient 
at finding massive lenses. We see both these expected ef¬ 
fects in figure [5| where we plot the predicted velocity 
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Fig. 5. — The velocity dispersion distribution for the lenses dis¬ 
coverable in forthcoming wide area surveys. Black shows Euclid, 
green shows CFHT, blue shows LSST and red shows DES. Surveys 
with better seeing are able to discover lenses with lower Einstein 
Radii and hence probe intrinsically less massive objects. 



Fig. 6. — The redshift dispersion distribution for the lenses dis¬ 
coverable in forthcoming wide area surveys. The lens redshifts are 
shown by the solid lines, the source redshifts are shown by the 
dashed lines. Black shows Euclid, green shows CFHT, blue shows 
LSST and red shows DES. 


dispersion function distribution of each survey. 

Figure [ 6 ] shows the source and lens the redshift dis¬ 
tributions of discoverable lenses in each of the surveys. 
There is no significant variation in the distribution of 
source redshifts between the surveys, although this is 
possibly due to the fact that our model only includes the 
g , r and i-band filters for CFHT, DES and LSST which 
precludes large numbers of high redshift sources and Eu¬ 
clid’s shallowness will also precludes finding a high red¬ 
shift source population without extreme magnification. 
An analysis of the discoverable infra-red lens population, 
may see a more significant variation of source redshift 
with imaging depth. Despite being the shallowest survey 
considered, we find that Euclid finds the highest fraction 
of high redshift lenses (Figure^. For an isothermal lens, 
the Einstein radius is proportional to Di s /D s hence at 
fixed source redshift, higher redshift lenses have a smaller 
Einstein radii making them harder to resolve in ground- 
based imaging. However, since th e highest redshift lenses 


currently known are at z ~ 1.6 (Wong et al. 2014| 


van 


der Wei et al. 2013), Figure [ 6 ] suggests that both DEB 


and LSST are also capable ofaiscovering higher redshift 


lenses than currently known. 

7. DISCUSSION: ERRORS ON FORECAST NUMBERS 

Our forecasts are based on a well-motivated prescrip¬ 
tion for generating a deflector and source population, 
lensing the background source, simulated observations of 
the lens and criteria that define a detectable lens. All of 
these stages have required simplifying assumptions that 
have the potential to systematically bias the lens number. 
In this section we investigate how small perturbations 
to the model can affect the forecast number of resolved 
lenses discoverable in the optimally filtered g — ai filtered 
stacks of the mock DES survey. 


7.1. Deflector Population 

We have assumed that there is no evolution in the co¬ 
moving number density of deflectors and that the velocity 
dispersion function does not evo lve with redshift . This 
is co nsistent w i th th e results of|Bezanson et al.| (|2011) 
and 
the 


Shu et al. ( 2012 ), which show minimal evolution of 
VDF out to z=0.5. Measuring velocity dispersions at 
higher redshifts is observationally challenging, but it is 
possible that there is evolutio n of the mass i ve gal axy pop¬ 
ulation at higher redshifts. Mason et al. (2015]) investi¬ 
gated the evolution of the VDFTising the stellar mass as a 
tracer, fitting a model of the form log(cr/ a z= q) oc (1 + z )& 
and co-moving number density evolving as oc (1 + z) a , 
and found a = 2.46 ± 0.53 and (3 = 0.20 d= 0.07. For 
a = 2.46, ft = 0 . 2 , the number of lenses detectable in 
the DES g — ai difference images increases by sixty per¬ 
cent. However, the magnitude of this un certain t y is m ost 
likely overestimated, as the fits of Mason et al. (2015) are 
skewed by evolution at high redshift where our model 
already predicts no significant lens population. Given 
the evidence for no significant evoluti on out to z ~1.5 
(Bez anson et al.pOll Shu et al. 2012 ) the no-evolution 
model seems a much more realistic representation of the 
Universe. We also investigated how deflector ellipticity 
impacts the number of detectable lenses, and saw no sig¬ 
nificant change in the discoverable lens fraction when in¬ 
creasing the ellipticity of all lenses by ten percent. 


7.2. External Shear 

Strong lenses do not tend to live in isolation; elliptical 
galaxies tend to cluster hence the tidal field of nearby 
masses perturbs strong lens systems. Simulating the full 
environments of strong lenses is beyond the scope of this 
work, but we can investigate how significant this effect 
is using a reasonable toy model. Our toy model assumes 
that the two components of the shear 71,72 are inde¬ 
pendently drawn from a Gaussian of width a = 0.05; 
this effect is larger than wou ld bey expected for random 
lines of sight (e.g. Pace et al.| 2015) but is in reasonable 
agreement with t he exte rnal s hears found by modeling 
strong lenses (e.g. [Wong et ah||2011). Applying this toy 
model we find a five percent increase in the number of 
detectable galaxy-galaxy strong lenses in the DES g — ai 
difference images; this moderate shear effect on the selec¬ 
tion function implies that the comparatively large shears 
in photometrically selected galaxy-galaxy strong lenses 
is typical of the environment of elliptical galaxies rather 
than a selection bias. 
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Unlensed source g-band magnitude 


(20l5|) recently analyzed the median sizes of galaxies in 
HST from redshift 0-10, finding that it is proportional 
to (1 + z) -1-12 ; if we use this result rather than the 
r s oc (l + z) -1,2 assumed in Equation[5j we find that this 
makes sources somewhat larger at high redshift (9 per 
cent at 2 = 2), which decreases the DES detectable lens 
population by six percent. An additional effect is related 
to the morphology of the sources, which are likely highly 
irregular. Our assumption of an elliptically symmetric 
exponential profile for the source is overly simplistic and 
if most of their flux originates from high density knots 
of star formation, their peak surface brightness may be 
larger and hence they may be more easily detected than 
our simple model predicts. 


Fig. 7. — The distribution of unlensed source g-band magnitudes 
for strong lens systems discoverable with LSST. The red histogram 
shows the population discoverable in the full LSST stack, whilst 
the gray population are discoverable if one searches for lenses in 
all possible (seeing-ordered) stacks. The shap e of the distributio n 
of g-band magnitudes for the source catalog ( |Connolly et a l. 2010) 
is shown as the dashed black line. Most lenses discoverable with 
LSST are intrinsically much brighter than the completeness limit 
of the source catalog. There is therefore unlikely to be a large 
population of discove rable strong lens es whose sources are too faint 
to be included in the |Co nno lly et al.| (2010) catalog. 


7.3. Source population 

Our LSST forecasts are underestimated, since our un¬ 
lensed source catalog is of comparable depth to the LSST 
co-add. In figure [7] we show that this is not a major effect; 
for most discoverable lenses the sources are brighter than 
the completeness limit of the source catalog. Given the 
luminosity function, this result appears counter-intuitive 
however faint sources are also small, hence few intrin¬ 
sically very faint galaxies can be resolved in the LSST 
imaging. The source catalog has been matched to the 
number counts of deep surveys, so the number of poten¬ 
tial background sources should be robust however there 
is scope for small changes to the number of detectable 
lenses, since the source pop ulation colors do not match 
the observed population (De Lu cia et aLp OOb) and the 
redshift distribution of sources fainter than i >~24 re¬ 
quires extrapolation. 


The C OSMOS photometric redshift catalog (Ilbert et 
al. 2009) is complete down to i = 25 and has muitU 
band HST photometry; this catalog implicitly includes 
the complex distribution of source colors and hence pro¬ 
vides a more realistic representation of the bright source 
population. Using the COSMOS photometric redshift 
catalog, we predict 760 lenses discoverable in the DES 
g — ai difference images. The original model had 840 
lenses with unlensed source magnitudes of i < 25. Red¬ 
shift catalogs much deeper than i = 25 do not exist, 
hence the redshift distribution of faint sources is poorly 
constrained; the uncertainty on the number of lenses with 
intrinsically faint sources is likely to be larger than the 
ten percent found for sources with z < 25, but we cannot 
precisely quantify this. 

The intrinsic sizes of the unlensed source population is 
a key parameter in predicting the detectable lens pop¬ 
ulation, since it sets the maximum magnification and 
the seeing required to resolve that the source is arced. 
Most of our sources are drawn from a high-redshift faint 
population that is poorly understood. Shibuy a et al. 


7.4. Definition of Detectability 

Whilst our detectability criteria are well matched to 
the CFHT lens sample, it is possible that lens finding in 
future surveys will be more difficult; pushing lens find¬ 
ing to fainter sources and smaller Einstein radii may 
introduce more false positives, forcing future lens find¬ 
ers to be more selective about what they consider to 
be a good lens candidate. We investigate three possi¬ 
ble tightenings of the detection criteria; changing the 
total signal-to-noise threshold from 20 to 40, changing 
the magnification threshold from 3 to 4 (i.e. requiring a 
more curved source) and changing the resolution thresh¬ 
old from 6 2 e > r 2 + (s/2) 2 to 0 E > r 2 + s 2 (i.e. requiring 
arcs and counter-images to be better resolved from each 
other and the lens). The number of lenses detectable in 
the DES g — ai difference images halves (down to 600) for 
the change in signal-to-noise, decreases by fifteen percent 
for the increased magnification constraint, and decreases 
by forty percent when requiring the lensed features to be 
more easily resolved. The change in signal-to-noise has 
the most significant effect, but for visual inspection of 
residuals, lensed features that are tangentially resolved 
and detected at a signal-to-noise of 20 are already unam¬ 
biguously strong lensing. 


8. CONCLUSION 

In this paper we have investigated the question ”how 
many galaxy-galaxy strong lenses can forthcoming sur¬ 
veys discover?” We have developed a realistic model for 
the lens and source populations to build a population 
of lenses that is representative of the strong lenses in 
the real Universe. We have then simulated the obser¬ 
vations of forthcoming surveys, paying particular atten¬ 
tion to the seeing distributions in each observing band 
- which we find to be a key variable in the number of 
detectable strong lenses. We have then analyzed these 
simulated observations to investigate which of the sys¬ 
tems are plausibly detectable, finding that in our most 
optimistic scenario DES, LSST and Euclid can discover 
2300, 120000, and 280000 lenses respectively, but only if 
the lens galaxy can be subtracted without degrading the 
resolution and signal-to-noise of the sample. We have 
shown that the detectability of lenses depends strongly 
on the search strategy adopted; despite the fact that 
lensed sources are typically much bluer than the lens, 
g — ai difference imaging such as implemented in Gavazzi 
et al.| (|2014|) only finds around a quarter of the optimisti¬ 
cally discoverable lenses since it degrades both the PSF 
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and the signal-to-noise. If future lens searches still use 
g — ai difference imaging but can match the PSFs to 
the worst of g or i seeing this can potentially discover 
twice as many lenses as the current foreground subtrac¬ 
tion method implemented in Ringfinder ; such a finder 
could potentially discover 1300 lenses in DES and 62000 
in LSST. The absence of color information will require 


the development of single band finders (e.g. Joseph et al. 


2014) to find lenses in Euclid. 

For LSST (and a lesser extent DES) we found that the 
majority of discoverable lenses can neither be discovered 
in the full co-add nor the best single epoch-imaging; the 
interplay between signal-to-noise and seeing mean that 
all possible (seeing-ordered) stacks must be searched to 
maximize the number of discoverable lenses. 

We have shown in Section 0 that the dominant source 
of error is the definition of a detectable lens. In future, 
our definition of detectability can be re-calibrated by run¬ 
ning lens finders on our simulated lenses like; this will 
allow us to robustly understand the selection effects of 
different lens-finders. Section 0 shows that the model it¬ 
self appears to be accurate to around a few parts in ten; 
given the uncertainty on defining detectability, this is 
sufficient for the purpose of this work, but going beyond 
this level of accuracy will need a significant improvement 
in our understanding of the faint source population and 
redshift distribution. 

One major issue that we have not addressed here is the 
question of false positives. The detectaility criteria out¬ 
lined in Section 0 seem reasonable for current datasets 
but may prove optimistic in future if greater depth and 
improved seeing reveals a large number of non-lenses that 
have morphologies similar to lenses. In particular, we 
have not required the detection of counter images, which 
may be necessary to achieve an acceptable sample pu¬ 
rity. It may also be that some regions of parameter space 
produce less pure lens samples than others; for example 
it is easier to recognize tangential shearing in thinner 
arcs; lensed low surface brightness sources may be hard 
to distinguish from face-on spiral arms or tidal features 
of a candidate lens galaxy. Spectroscopic confirmation 
of lens candidates is expensive and robust lens model¬ 
i ng presents a challenge in the face of large numbers 
(Brault & Gavazzi 2014); future searches may require 
much higher purity than current samples. The selection 
function of future lens finders may be more conservative 
than we have assumed in this work. 

We have made the code used in this work publicly avail¬ 
ably future strong lens finding algorithms can therefore 


be tested on these systems, to asses what subset of the 
full lens population they will discover. If -once the find¬ 
ers has been run on the simulated lens sample- the prop¬ 
erties of discovered lens populations deviate significantly 
from that predicted by our model it will provide evidence 
that some aspect of the model’s deflector or source pop¬ 
ulation is incorrect. Whilst strong lensing has al read y 
shed li ght on the high redshift source population (New- 
ton et ak]|2Qll|), and the mass distribution in strong len s 
galaxies (e.g. Auger et al.||2010 Sonnenfeld et al.||2013 ) 


these results may be biased by the discoverable strong- 
lens selection function. Given enough lenses and com¬ 
putational power a model such as the one presented in 
this work could plausibly be used to infer the many pa¬ 
rameters of Section [2] This would impli citly include the 


strong lens selection function (e.g. Sonnenfeld et al. 2015) 
and should give unbiased inference on the properties of 
the deflector and source populations. 

Over the next decade, surveys will have sufficient 
signal-to-noise and resolution to discover hundreds of 
thousands of galaxy-galaxy strong lenses, however no one 
human can hope to perform visual inspection on such a 
large sample. Since all current lens finders need some 
level of human inspection and extracting scientific re¬ 
sults from lenses requires human input in the form of lens 
modeling, Making full use of these hundreds of thousands 


(Marshall et al. 

2005] Brault & Gavazzi||2014, e.g.) or 

crowd-sourcing ( 

Marshall et al. 2015! More et al. 2015 

Kiing et al. 2015 

F~ 
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